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Abstract 

We show that gamma distributions provide models for departures from randomness since every 
neighbourhood of an exponential distribution contains a neighbourhood of gamma distributions, 
using an information theoretic metric topology. We derive also the information geometry of the 3- 
manifold of McKay bivariate gamma distributions, which can provide a metrization of departures 
from randomness and departures from independence for bivariate processes. The curvature objects 
are derived, including those on three submanifolds. As in the case of bivariate normal manifolds, we 
have negative scalar curvature but here it is not constant and we show how it depends on correlation. 
These results have applications, for example, in the characterization of stochastic materials. 



1 Gamma distributions and randomness 

The family of gamma probability density functions is given by 

{p(x;/3,a)=(jj\ ^ e -f x \a,(3eR + }, xeR + (1.1) 

so the space of parameters is topologically M + x K + . It is an exponential family and it includes as a 
special case (a = 1) the exponential distribution itself, which complements the Poisson process on a line. 
It is pertinent to our interests that the property of having sample standard deviation independent of the 
mean actually characterizes gamma distributions, as shown recently by Hwang and Hu ^Oj- They proved, 
for n > 3 independent positive random variables x%, x?, ■ ■ ■ , x n with a common continuous probability 
density function /, that having independence of the sample mean x and sample coefficient of variation 
cv = S/x is equivalent to / being a gamma distribution. Of course, the exponential distribution has unit 
coefficient of variation. 

The univariate gamma distribution is widely used to model processes involving a continuous positive 
random variable. Its information geometry is known and has been applied recently to represent and 
metrize departures from randomness of, for example, the processes that allocate gaps between occurrences 
of each amino acid along a protein chain within the Saccharomyces cerevisiae genome, see Cai et al 0], 
clustering of galaxies and communications, Dodson 00 El- In fact, we can make rather precise the 
statement that around every random process on the real line there is a neighbourhood of processes 
governed by the gamma distribution, so gamma distributions can approximate any small enough departure 
from randomness. 



Proposition 1.1 Every neighbourhood of an exponential distribution contains a neighbourhood of gamma 
distributions, using the subspace topology o/R 3 and information theoretic immersions. 

Proof: Dodson and Matsuzoe [5] have provided an affine immersion in Euclidean R 3 for Q, the manifold 
of gamma distributions with Fisher information metric. The coordinates (O 1 ^ 2 ) = (// = a/P,a) form a 
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Figure 1: Affine immersion in natural coordinates n = a/ ft, a as a surface in M 3 for the gamma manifold 
G; the tubular neighbourhood surrounds all exponential distributions — these lie on the curve a = 1 in the 
surface. 

natural coordinate system (cf Amari and Nagaoka |2j) for the gamma manifold Q . Then Q can be realized 
in Euclidean R 3 by the graph of the affine immersion where £ is the transversal vector field along 



and a tubular neighbourhood of this curve will contain all immersions for small enough perturbations 
of exponential distributions. In Figure ^ this is depicted in natural coordinates p., a and in Figure |21 
the corresponding surface and tubular neighbourhood (not here an affine immersion, just a continuous 
image) is shown in the usual (a, (3) coordinates of the gamma family The tubular neighbourhood in 

Figure [3 intersects with the gamma manifold immersion to yield the required neighbourhood of gamma 
distributions, which completes our proof. □ 

A simple transformation of random variable x G M. + in i|l.l|) to N — e~ x £ [0, 1] converts a gamma 
distribution into a log-gamma distribution, which turns out to have the same geometry. 

Proposition 1.2 (Dodson [7j) The family of log- gamma probability density functions 



h HE!: 




The submanifold of exponential distributions is represented by the curve 



(0,oo)-.M 3 :/i^{ M ,l,log-} 



{g(N,a,[3) 



^(frqog^) 

T(a) 



\a,[3e M.+ }, Ne [0,1 



(1.2) 



determines a Riemannian manifold C with information-theoretic metric having the properties: 

• £ contains the uniform distribution as the limit: lirng^i g(N, [3, 1) = g(N, 1, 1) = 1 

• C contains approximations to truncated Normal distributions for (3 > > 1 

• C is isometrically eguivalent to the gamma manifold Q . 



□ 



Through this isometry and the result of Dodson and Matsuzoe [S] we have an immersion in R 3 that 
represents also the log-gamma manifold and, since the isometry sends the exponential distribution to the 
uniform distribution on [0, 1], we obtain another deduction: 
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Figure 2: Continuous image of the affine immersion in Figure Q as a surface in R 3 using standard 
coordinates for the gamma manifold Q ; the tubular neighbourhood surrounds all exponential distributions — 
these lie on the curve a — 1 in the surface. 



Proposition 1.3 Every neighbourhood of the uniform distribution on [0,1] contains a neighbourhood of 
log-gamma distributions. □ 

The value of such topological results lies in the fact that they have qualitative consequences that are 
stable under small perturbations of a process, something that would be important in real applications. It 
gives confidence in the use of gamma distributions to model near random processes. Moreover, the fact 
that we have an information-theoretic metric topology for the neighbourhoods lends significance to the 
result. 

For comparison purposes, we recall that the information geometry of the univariate and multivariate 
normal distributions also are known, see for example Lauritzen [Tl in Amari et al pQ for the univariate 
case, Sato et al ^B] for the bivariate case and Skovgaard ^7] for the multivariate case. We note in 
particular that the univariate and bivariate normal distributions have constant negative scalar curvature, 
so geometrically they constitute parts of pseudospheres. 

First consider the 3-parameter family of univariate gamma distributions with density function : 

g(x;0,a, 7 )= (~j e'^ 1 , x > 7 > 0, 0,a>O. (1.3) 

Evidently, the extra parameter 7 > is a location shift and when 7 = we recover the univariate gamma 
distribution ll.lfl . and when a = 1 we obtain the exponential distributions with two parameters. The 
mean x, standard deviation a x , and coefficient of variation cv x , for l|1.3[) are given by 

x = p + 7, a x = — , cv x 



y/a(0 + 7) ' 

The distribution l|1.3(l gives us a slight generalisation of the gamma distribution (|l.l(l , which we use in 
the sequel. 

One of the earliest forms of the bivariate gamma distribution is due to Mckay defined by the density 
function 

(.{ai+otn) x <x-l — l(y _ x \ a 2 — lg — cy 

f(x,y) = — — — — r defined on y > x > , a u c,a 2 > (1.4) 

r(aijr(a2) 



The marginal distributions are gamma with shape parameters ct\ and a± + «2 , respectively. The covari- 



4 



Neighbourhoods oj randomness and geometry oj McKay bivamate gamma 3-manijold 



ance Cov and correlation coefficient p of X and Y are given by : 



Cov(X,Y) = ^ (1.5) 



p(X,Y) = J— (1.6) 
V a\ + a 2 

2 Bivariate 3-parameter gamma 5-manifold 

In this section we introduce a bivariate gamma distribution which is a slight generalization of that due 
to Mckay, by substituting (x — 71) for x and (y — 72) for y in equation (|1.4|) . We call this a bivariate 
3-parameter gamma distribution, because the marginal functions are univariate 3-parameter gamma 
distributions. Then we consider the bivariate 3-parameter gamma models as a Riemannian 5-manifold. 
The Christoffel symbols have been calculated but are not listed because they are somewhat cumbersome. 



Proposition 2.1 Let X and Y be continuous random variables, then 

T{ax)T{a 2 ) 

defined on (y — 72) > (x — 71) > , ai, c, a 2 > 0, 71, 72 > 0, is a density function. The covariance and 
marginal density functions, of X and Y are given by: 

0-X2 = % (2.8) 
fx(x) = 1 ^ , , x> 7 i>0 (2.9) 

My) - ly r T 2j , , y>72>o (2.10) 

1 {ax + a 2 ) 

□ 

Note that the marginal functions fx and fy are univariate 3-parameter gamma distributions with pa- 
rameters (c, ax, 71) and (c, ai -I- a 2l 72), where 71 and 72 are location parameters. We shall refer to 12.7( 1 
as giving the bivariate 3-parameter gamma distributions. 

It is easily shown that c = > so the bivariate 3-parameter gamma distribution (|2.7j) can be presented 

in the form: 



/(*,») = M 2 " ^-^""^-^^-^^ e-^^), (2.11) 
V<7i2/ r(ai)r(a 2 ) 

defined on (y — 72) > [x — 71) > 0, with parameters ax,a 2 , oxi > 0, and 71, 72 > 0. 
Proposition 2.2 Lei M* be the set of bivariate 3-parameter gamma distributions, that is 

VCT12/ r(ai)r(a 2 ) 
(y-72) > (a: -71) > 0, ax,a 2 > 2, a 12 > 0, 71,72 > 0.} (2.12) 

Then we have : 

1. Identifying {ax, a%, o~x 2 , 71, 72) as 0, local coordinate system, M* can be regarded as a 5-manifold. 

2. M* is a Riemannian manifold with Fisher information matrix G = [gij] where 

°° r d 2 \ogf{x iy ;x\xi,x 3 ,x\x 5 ) 1 2 3 4 5 W , 

/ t. ip. J\x,y,x ,x ,x ,x ,x ) dy dx 

71 J x— 71+72 ® X 
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and x 1 — ai, x 2 — a 2 , x 3 
is given by : 



V'(ai) 



-3 Q1+Q2 
4 a\ 2 



2 ai 

Q1-Q2 
4 ai 



(-1+ai) yjaii 
-1 



o"i2, £ 



2ai 

V'(a 2 ) 
1 

2(712 



(1—0:2) V'is 



7i, ^ 



72- 



2 y/cn y^i2 (—1+0:2) V CT i2 

w/jere = ^ (i = 1,2). 



ai — Q 2 
4 ai (T12 
1 

2(712 

ai +Q2 

4(T12 2 



T 

2(T12 2 



(-l+a:i) \Z&12 
(1 — 0:2) \/Tl2 





+ 



(-2+Oi)<Tl2 (-2+a 2 )(7l2 

(-2+Q2) CT12 



2 ^/aTf \/ci2 
(-1+0:2) V CT 12 

r 

2(712? 
-Ol 

(— 2+a 2 ) CT12 
ai 

(— 2+0:2) CT12 



(2.13) 



□ 



3 Mckay bivariate gamma 3-manifold 

In this section we consider the Mckay bivariate gamma model as a 3-manifold, equipped with Fisher 
information as Riemannian metric, and derive the induced geometry, i.e., the Ricci tensor, the scalar 
curvatures etc; the Christoffel symbols were computed but are omitted here. In addition, we consider 
three submanifolds as special cases, and discuss their geometrical structure. 

3.0.1 Fisher information metric 

The classical family of Mckay bivariate gamma distributions is given by: 



(°l+°2) 



f{x,y;ai,a 12 ,a 2 ) = 



(£r)^-^ Q1_1 (i/-aO 



T(a 1 )T(a 2 ) 



(3.14) 



defined onO<i<j/<oo with parameters ai,a\ 2 ,a 2 > 0. Where a\ 2 is the covariance of X and Y. 
The correlation coefficient and marginal functions, of X and Y are given by : 



P(X,Y) 



ai 



a.i + a 2 



fx(x) = ^ 



' ., \j "12 



r(ai) 



x > 



(3.15) 



(3.16) 



Mv) 



,(«!+( 



-Ig-V^ 1 ' 



-, y >o 



r(ai + a 2 ) 

Note that it is not possible to choose parameters such that both marginal functions are exponential. 
Proposition 3.1 Let M be the set of Mckay bivariate gamma distributions, that is 



(3.17) 



M 



{f\f(x,y;a!,a 12 ,a 2 ) 



( — ) 

\ (7io / 



(°l+c 2 ) 



L (y - x) 



02-1, 



r(ai)r(a 2 ) 



y > x > 0, ai, (T12, a2 > 0} 

Then we have : 

1. Identifying (ai, <J\ 2 , a 2 ) as a local coordinate system, M is a 3-manifold. 

2. M is a Riemannian 3-manifold with Fisher information metric G = [gij] given by : 



(3.18) 



- —3 0-1+0-2 



[9ij] = 



i 0:1+0:2 1 MjaA ai ~ a2 i_ 

TaP r V V"U 4ai CT12 2ai 

01—02 01+02 1 

4 ai <7i2 4 CT12 2 2 (7i2 



(3.19) 
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3. The inverse \g % i] of [gij] is given by: 



g 11 



g 



g 22 



-1 + (ai + a 2 ) tp'(a 2 ) 



ip>(a 2 ) + V'(ai) (1 - ("i + a 2 ) V {<**)) 
\2 _ 21 _ °"i2 (1 + {on ~ ot2) ^'(a 2 )) 



ai (ip'(a 2 ) + V'(ai) (1 - (an + a 2 ) ip'(a 2 ))) 



13 _ „31 _ L 



-ip'(a 2 ) + ip'icn) (-1 + (ai + a 2 ) ip'{a 2 )) ' 
Q-12 2 (-1+ (~3ai +Q 2 +4ai 2 V) / (g 1 )) ^ / (« 2 )) 
ai 2 (-#(a 2 ) + (-1 + (ai + a 2 ) V>'(a 2 ))) ' 



■23 32 _ gig (-1 + 2aiV' , (Q;i)) 

ai (V''(a 2 ) + ^)'(ai) (1 - (ai + a 2 ) ^'("2))) ' 

33 _ ( -1 + (ai +a 2 ) ip'jai) \ , , 

9 ~ U'(«2)+V'("i) (l-(ai+a 2 ) V'(« 2 ))y ' 1 J 



□ 



3.0.2 Curvature properties 



We provide the various curvature objects of the McKay 3-manifold M; the Christoffel symbols are known 
but they are omitted here. 



Proposition 3.2 The components of the curvature tensor Rijki are given by: 



-R1212 
-R1213 

-Rl223 
^1313 
-Rl323 



ip'(a 2 ) (#(ai) + (<*i + <* 2 ) V>"(Q!i)) 

16 (Ti 2 2 (^'(ai) + ^'("2) - (ai + a 2 ) ip' (ai) ip'(a 2 )) ' 

V>'(a 2 ) (V> , (ai) + 2a 1 y» // (a 1 )) 

16 ai (712 (ip'(ai) + tp'{a 2 ) - (ai + a 2 ) V'(ai) ^'("2)) ' 

-^(ai) ?//(a 2 ) 

16a 12 2 (V>'(ai) + V'(a2)-(ai+a 2 ) V'(ai)^(a 2 )) ' 
- (- (V/K) y/(a 2 )) + ((3 ai - a 2 ) + W y/'(ai)) V>"(a 2 )) 

16 ai 2 (V>'(ai) + i>'(a 2 ) - (c*i + a 2 ) ^'("2)) 
-^'(a 2 ) (V>'(a 2 ) + (-ai + a 2 ) V>"(a 2 )) 
16 ai 0-12 {ip'(ai) + ip'(a 2 ) - (ai + a 2 ) V'( a i) 4>'{ol 2 )) ' 



d ^(ai) (y/(a 2 ) + (ai + a 2 ) tp"(a 2 )) , - 

2X2323 - le^ 2 (^(a 1 ) + ^(a 2 )-(a 1 +a 2 )^(a 1 )^'(a 2 ))' ^ J 



wMe t/ie of/ier independent components are zero. 



□ 
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Proposition 3.3 The components of the Ricci tensor are given by the symmetric matrix R = [R^ 

-3 V'(c*i) 2 V'(a 2 ) - 3 V'(« 2 ) 2 



R 



li 



16 ax (ip'(ai) + ip'{a 2 ) - (a\ + a 2 ) ip'(cti) ip'{a 2 )) 2 

cti tp'jai) 2 i>'{a 2 ) + a 2 ip' (ai) ip' (a 2 ) 2 

16ai 2 (^'(ai) + ip'(a 2 ) - (ai + a 2 ) V'M V-'M) 2 

^(ai) 2 y/(a 2 ) 2 - 2 fjo^) jj'(a 2 ) ^"( ai ) | 
4 (V-'(ai) + V'(a 2 ) - (ai + a 2 ) V'(ai) V''(« 2 )) 2 
- (ag V»'(a 2 ) + a 2 2 y/(a 2 ) 2 y,"( ai ) 

16 ai 2 (V»'(ai) + V'(a 2 ) - (ai + a 2 ) ^'(ai) V-'M) 2 
ai ^(ai) V' / (ct 2 ) 2 V>"(ai) + a 2 V>'(ai) V>'(a 2 ) 2 tp"(ai) 

4 (V>'(ai) + ^'(a 2 ) - (ai + a 2 ) V'(ai) V^)) 2 
3^ , (a 2 )V> /, (ai) + 3^ , (ai)^ /, (a 2 ) 
16 ai (i>'(ai) + tp'(a 2 ) - (ai + a 2 ) ij)'{a\) tp'(a 2 )) 
-3 ^( a2 ) 2 ^(ai) - 3 ^'( ai ) 2 y/'(g 2 ) | 
16 (^(ai) + V'(a 2 ) - («i + « 2 ) ^'(<*i) ^'M) 2 



+ 



- (a 2 ~ip'{a 2 ) 2 ip"{ai)j - a 2 ip' (ai) 2 ip" (a 2 ) 

8 ai {i>'(ai) + ip'(a 2 ) - (ai + a 2 ) V''(a 2 )) 2 

- (ag ^(ai) ^(a 2 )) + a 2 2 ^(ai) 2 V>"(a 2 ) 
16 ai 2 (^'(ai) + i>'(a2) - (ai + a 2 ) ^'(^i) ^'(a^)) 2 
V>"(ai) ^(a 2 ) - ai ^(ai) ^(ai) V>"(a 2 ) - a 2 V^(ai) ^(ai) V>"(a 2 ) 
4 (V>'(ai) + V'(a 2 ) - (ai + o 2 ) ^'(«i) V'(a 2 )) 2 



V'(ai) V-'M + V'(ai) V-'M - V>'M 

Kl 2 = 2 + 

16 (Ti 2 (i>'{a{) + ip'(a 2 ) - (ai + a 2 ) ^'("2)) 



+ 



- (a 2 ^'(ai) 2 "0'(a 2 )) - " 2 ^'("l) ^'( a 2 ) 2 + a 2 ^'("2) 4>"{ot\) 

16 ai cti 2 (^'(ai) + ip'{a 2 ) - (ai + a 2 ) ^'(a^)) 2 
Qi '0'(a 2 ) 2 ip"{ct\) - ip'(ai) ip"(a 2 ) + ai ip'{ai) 2 ip"(a 2 ) 
16 cri2 + ip'(a 2 ) - (ai + a 2 ) V'(ai) V''(a 2 )) 

- (a 2 2 7//(a 2 ) 2 + a 2 ^"M - a 2 2 <//(ai) 2 V"M 

16 ai ci 2 (^'(ai) + i/>'(a 2 ) - (ai + a 2 ) V'(ai) ^'(a 2 )) 2 

- (^'(ai) 2 V'(a 2 )) - ^'(ai) V'(« 2 ) 2 + ^(a 2 ) ^"(01) 

^?13 — 2^ 

8 ai (^'(ai) + ^'("2) - (ai + a 2 ) V'(ai) ^'("2)) 
^'(a 2 ) 2 (^"( ai ) + 2^( ai ) 2 ) +^"(a 2 )(^(ai) 2 +2^"(ai)) 
8 (^'(ai) + V'(a 2 ) - (ai + a 2 ) ^'("i) ^'(a 2 )) 2 

- (a 2 V(a 2 ) 2 ^"(ai)) + V'(ai) ^"(a 2 ) - a 2 ^{ ai f V"(a 2 ) 

8ai (V>'(ai) + ^'(a 2 ) - (ai + a 2 ) 4>'(a\) ip'(a 2 )) 2 



+ 



(ai + a 2 ) (V' , (a 2 ) (^(ai) (V^Qi) + ^(a 2 )) - ^"(ai) + (gi + a 2 ) V^Qgj ^"(^1))) 

16ai 2 2 (V»'(ai) + V'(a 2 ) - (ai + a 2 ) ^'(ai) i/-'(a 2 )) 2 
(ai + a 2 ) (V^Qi) (-1 + (ai + a 2 ) ^(ai)) V' ,/ (a 2 )) 
16 cna 2 (V>'(ai) + V'M - (ai + a 2 ) ^'(ai) ^'(a 2 )) 2 ' 
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yj'{a 2 ) (£(ai) {fjai) + ^(a 2 )) - ^(ai) + (gi + a 2 ) V{a 2 ) */>"(ai)) 
8CT12 (ip'(ai) + ip'(a 2 ) - (a x + a 2 ) V'(ai) V''^)) 2 
Vy(ai) (-1 + (ai + a 2 ) ^(ap) ^"(a 2 ) 
8o"i2 (i>'(aii) +ip'(a 2 ) - (ax + a 2 ) ip'{ax) ip'(a 2 )) 2 



^ -2 V-'(ai) V'(« 2 ) + (ai + a 2 ) ^'(a 3 ) (V («i) 2 - V^'(ai)) + </>"(<*! )) ^"M 

-R33 — 9 h 

4 + ^'(a 2 ) - (ai + "2) V>'(a 2 )) 

V' / (ai) 2 ^ / (a2) 2 , 3 22 s 

4 (^(ai) + ^'(a 2 ) - {ax + a 2 ) ^'(ai) V'M) 2 ' 

□ 



Proposition 3.4 TTie scalar curvature R of M is given by: 

R = ^(ai) 2 ^(a 2 ) + ip'(ai) ^'{a 2 ) 2 + ai ^{a 2 f f^x) 
2 (ip'(ax) + ip'{a 2 ) - [ax + a 2 ) ^'M) 2 
-(^(a 2 )V' ,, (ai))-V' , (ai)V' ,, (a2) | 
(V>'(ai) + ^'{ot 2 ) - (ai + a 2 ) ip'(ai) ip'{a 2 )) 2 
a 2 t^'{a 2 ) 2 ip"(ax) + ol\ ip'{ax) 2 ip"{a 2 ) + a 2 ip'(ctx) 2 i[>"(a 2 ) 
2 {ip'iax) + ip'(a 2 ) - {ax + a 2 ) ip>(ctx) ip'{a 2 )) 2 
- {ax rjax) r{a2)) - a 2 f(ai)fM (g 2g) 

2 (V»'(ai) + V'M - (ai + a 2 ) ^'(ai) V>'M) 2 ' 

T7ws /ias limiting value us ai,«2 ^ 0. □ 
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Proposition 3.5 The sectional curvatures of M are given by: 

-ip'(a 2 ) (i>'{ai) + {a x + a 2 ) i>"(ai)) 



0(1,2) = 
0(1,3) = 



4 (-1 + (ai + a 2 ) V'(ai)) (^'("l) + ^'("2) - (oji + oj 2 ) ^'("2)) ' 

- (^(ai) V/(a- 2 ) + ((-3 ax + a 2 ) - 4 ai 2 ^"(ai)) ip"{a 2 )) 

4 (^'(ai) + ip'(a 2 ) - (ai + as) ^'(^2)) (-1 + (a 2 + «i (-3 + 4ai ^'(ati))) ^'(a 2 )) ' 

/„ - -yy(ai) (^(a 2 ) + (ai + a 2 ) ^"(a 2 )) , > 

~~ 4 (_l + ( ai +a 2 ) V/(a 2 )) (^'(ai)+V'(a2)-(ai+a 2 ) ^'(ai) V'M) ' l ' ' 



Proposition 3.6 The mean curvatures g(X) (A = 1,2,3) are given by : 



e(2) 



3ai ^'(ai) 2 "0'(a2) + a 2 ?/;'(ai) 2 ip'(a 2 ) — 3ai ^'(aa) ip'(a 2 ) 2 



0(3) = 



8 (a 2 +ai (-3 + 4ai V'(ai))) (V>'(ai) + V'(a 2 ) - (ai + a 2 ) ^'(ai)^'^)) 2 

a 2 ^'(ctx) ^'(a^) + 4ai 2 ^'(ai) 2 ip'(a 2 ) + 3 ai ip'{a 2 ) ip"(cti) ^ 
8 (a 2 + ai (-3 + 4ai V'(ai))) W>'(ai) + V>'(a 2 ) - (ai + a 2 ) ^'(c^)) 2 

- (a 2 VA« 2 ) V»"(<*i)) - 8 Ql 2 ^(aQ ^(a 2 ) y,"( ai ) - 3 o^ 2 ^(a 2 ) 2 ^(aQ | 
8 (a 2 + ai (-3 + 4 ai ip'(ai))) (4>'(ati) + ip'(a 2 ) - (a x + a 2 ) V'(ai) ^'(c^)) 2 
-2 ai a 2 y/(a 2 ) 2 y/>i) + a- 2 2 y/(« 2 ) 2 V>"(ai) + 4a! 3 ^'(ag) y/(a 2 ) 2 y/'(ai) 
8 (a 2 + ai (-3 + 4ai V'(ai))) (^'("l) + ^'("2) - («i + a 2 ) ^'(<*i) ip'(u 2 )) 2 

4 ai 2 a 2 yy(ai) y/(a- 2 ) 2 jfjai) + 3 gi y/(ai) V/'(a 2 ) - « 2 ^(gi) V^M 
8 (a 2 + ai (-3 + 4ai^'(ai))) (^'(ai) + ^'("2) - («i + a 2 ) i>' (a^ ip' (a 2 )) 2 

-3Qi 2 ^(ai) 2 ^ ,, (a 2 ) - 2aia 2 ^'K) 2 ^"M + a 2 2 *P'(aif tp"(a 2 ) 
8 (a 2 + a 1 (-3 + 4a 1 ^'(a 1 ))) (^'(ai) + ^'(a 2 ) - (a x + a 2 ) V'(<*i) ^'(o^)) 2 
ai 2 ^(ai) ^"(a 2 ) - Qi 3 ^(ai) ?//'(ai) V>"(a 2 ) - ai 2 a 2 *A'(ai) ^"(ai) ^"M 
2 (a 2 + ai (-3 + 4ai^'(ai))) {^'{a x ) + t/j'(a 2 ) - (ai + a 2 ) V'((*i) V^)) 2 ' 
^(a 2 ) (^(ai) (^(ai) + ^(a 2 )) - ^"(ai) + (ai + a 2 ) ^(a 2 ) ^"(ai)) | 
8 (^(ai) + ^'(a 2 ) - (ai + a 2 ) x//( ai ) ^'(« 2 )) 2 
V>'(ai) (-1 + (ai + a 2 ) V>'(ai)) ^"(02) 
8 (^'(oi) + ^'(a 2 ) - (ai + a 2 ) V>'(ai) ^'M) 2 ' 

2 V'(ai) V'(a 2 ) + (ai + a 2 ) ^'(a 2 ) (v>'(ai) 2 - ^"(ai)) + V"(«a) 
8 V'(a 2 ) (V-'(ai) + ^'(a 2 ) - («i + a a ) ^'(«i) ^'H)) 2 

^(ai) 2 ^(a 2 ) 2 

8^ / (cx 2 )(V''(ai)+V'(«2)-(ai + Q!2) ^' (a^ ^ (a 2 )) 2 ' 



□ 



(3.25) 

□ 



4 Submanifolds of the Mckay 3-manifold M 

We consider three submanifolds Mi , M2 and M3 of the 3-manifold M of Mckay bivariate gamma distri- 
butions (|3.18(l f(x, y; a\, <T\ 2 , a 2 ), where we use the coordinate system (ptx, a\ 2 , a 2 ). These submanifolds 
have dimension 2 and so it follows that the scalar curvature is twice the Gaussian curvature, R = 2K. 
Recall from above that the correlation is given by 

I ai 

p = \ — ; — • 

V ot\+ a 2 

In the cases of M\ and M 2 the scalar curvature can be shown as a function only of p. 
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4.1 Submanifold M 1 C M: a x = 1 

The distributions are of form: 



(— ^— ) ^ (y — x)" 2 e 
f(x, y; 1, C7i2, a 2 ) = — 



r(a 2 ) 



(4.26) 



denned onO<i<i/<oo with parameters (7i2,a2 > 0. The correlation coefficient and marginal 
functions, of X and Y are given by : 



P{X,Y) 
fx(x) 

My) 



i 



a/1 + a 2 

——— e v^T2 ) x > 

\/ cr 12 

(- -7= ) ( 1+Q2 ) y Q2 e~ v^S 4 
«2 T(a 2 ) 



y>0 



(4.27) 
(4.28) 

(4.29) 



So here we have a 2 = p2 ° , which in practice would give a measure of the variability not due to the 
correlation. 



Proposition 4.1 The metric tensor [gij] and its inverse [g^] are as follows : 



G=[giA = 



r l+a 2 1 
4(Ti2 2 2a 12 

2k V>'M 



(4.30) 



G- 1 = 



-2cri2 



4 ffi2 2 ip' (012) _ _ _ 

-l+(l+a 2 )i/>'(a 2 ) -l+(l+a 2 ) f (a 2 ) 
-2 CT12 1+0:2 



-l+(l+a 2 ) if,' (a 2 ) -1+(1+Q 2 )^'("2) . 



(4.31) 
□ 



Proposition 4.2 The Christoffel symbols of Mi are 

-A -I- 1 

-l + (l+a 2 ) V'(C2) 



r 1 
1 11 



r 1 
1 12 



r 1 
1 22 



r 2 
r 



r 2 
1 22 



4(712 



r 1 - 
1 21 — 



11 — 

2 _ 

12 — 



-2 + 2 (l + a 2 ) 4>'(a 2 ) ' 
gi2'0 ,/ (a2) \ 
-l + (l + a 2 ) Vict*)) ' 

-(l + a 2 ) 

ia 12 2 (-l + (l + a 2 ) ^'(a 2 )) ' 



r 2 - 
1 01 — 



-1 



21 4a 12 (_i + (i + a 2 ) V'(a2)) 
(l + a 2 ) V"("2) 



-2 + 2 (l + a 2 ) V'M 



(4.32) 

□ 



Proposition 4.3 T/ie curvature tensor of Mi is given by 

- (V'(a 2 ) + (1 + a 2 ) V"(a 2 )) 



i? 



1212 — 



16(Ti2 2 (-l + (l + a 2 ) ^'(a 2 )) 



(4.33) 



luMe the other independent components are zero. 
By contraction we obtain: 
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Figure 4: The scalar curvature R as a function of correlation p for McKay submanifolds: M\ (M with 
a\ = 1) where R increases from — | to 0, and M 2 (M with a 2 — 1) where R decreases from to — |. 



Ricci tensor: 



Scalar curvature: 



Ru — 

Rl2 = 



(l + ag) (^(a 2 ) + (1 + a 2 ) ^"M) 
16cr 12 2 (-l + (l + a 2 ) V"(a 2 )) 2 
?/>'(a 2 ) + (1 + a 2 ) ^"(a 2 ) 

8<ri2(-l + (l + a 2 ) ip'{a 2 )f ' 



^ = ^'M (^> 2 ) + (l + a 2 ) ^(a 2 )) (4 34) 

4(-l + (l + a 2 ) V'(« 2 )) 2 



* = V^M±(i±^) g>g) (4.35) 
2(-l + (l + a 2 ) ^(a 2 ) f 

□ 



4.2 Submanifold M 2 C M: a 2 = 1 

The distributions are of form: 



(^.)^-i ai -V 



/(x, y; ai , (7i2, 1) - ^ , (4.36) 

defined on0<s<j/<oo with parameters ai,(7i 2 > 0. The correlation coefficient and marginal 
functions, of X and Y are given by : 



V 1 + cti 

fx(x) = W - , x>0 (4.38) 

r(ai) 

My) - - air(ai) . ^>o (4-39) 



Here we have ai = 



12 



Neighbourhoods oj randomness and geometry of McKay bivariate gamma 3-manijold 



Proposition 4.4 The metric tensor [gij] and its inverse [g l i] are as follows 



G = hj] = 



' \ ' 4 CKi <7\2 



4ai- 



-1+tti 

4 Qi CT12 



l±Ol 
4(T12 2 



G- 1 = [gv] = 



l+Ql 



-l+(l+ai)V'(ai) 

(-l+ai)eri2 
ai (-l+(l+ai) V'(ai)) 



(-1+ai) a-12 



v ai (-l+(l+ai)V'(ai)) 
<T12 2 (l+ai (-3+4an/.'(ai))) 
m 2 (-l+(l+ai) V'(ai)) 



(4.40) 

(4.41) 
□ 



Proposition 4.5 The Christoffel symbols are 

-1 + ai (3 + 4 ai (1 + ai) V"(ai)) 



1 1 1 — 



12 



r 1 

1 22 



r 1 

1 9 



p2 

111 — 



12 



8ai 2 (_i + (i + ai) V'(ai)) ' 

= -(-1 + ai) 

21 ~ 8 ai (712 (-1 + (1 + ai) V'(ai)) ' 

+ 

8a 12 2 (_l + (l + ai) V'K)) ' 

ai2 + (-3 + 8^(01) -4 (-1 + ai) aii/>"(ai))) 
8ai 3 (_i + (i + ai) V"(ai)) 
1 + ai (-3 + 4ai^'(ai)) 



8ai 2 (_i + (i + ai) V'K))' 



r 2 
1 22 



-8 + 



ai (-l+(l+ai)f(ai)) 
8(Ti2 



Proposition 4.6 The curvature tensor is given by 

(V'(oi) + (l + ai) V"("i)) 



-R1212 — 



16ai 2 2 (-l + (l + ai) V'(ai))' 

ly/w/e i/ie other independent components are zero. 

By contraction we obtain: 
Ricci tensor: 



(4.42) 
□ 

(4.43) 



R21 
R22 



Scalar curvature: 



(1 + ai (-3 + 4ai^(ai))) (^(ai) + (1 + Qi) V>"(ai)) 

16ai 2 (-l + (l + ai) ^'(ai)) 2 
(-1 + ai) (^(ai) + (1 + Qi) ^"(ai)) 

16ai<ri 2 (-l + (l + ai) V-'(ai)) 2 
(1 + ai) (^ / (Q 1 ) + (l + a 1 ) V> /; (ai)) 

16ai 2 2 (-l + (l + ai) ^'( ai )) 2 

^(ai) + (l + ai) V"(ai) 



i? 



2(-l + (l + ai) V'(ai)) 



2 • 



(4.44) 

(4.45) 
□ 



4.3 Submanifold M 3 C M: a 12 = 1 

tic 

( ai y 



Here the distributions have unit covariance and are of form 

, (01+02) 

f{x,y;a 1 , l,a 2 ) 



ai_1 (j/ - x)" 2 - 1 e-\ /sr 2' 



r(ai)r(a 2 ) 



(4.46) 
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defined onO<a;<y<oo with parameters ct\, a 2 > 0. The correlation coefficient and marginal functions, 
of X and Y are given by : 



p(X,Y) = J-^— (4.47) 
V a.\ + a 2 

fx(x) = Vai X r{ai) e , x>0 (4.48) 

/— (ai+a 2 ) , (ai+a 2 )-l p — v/aTl/ 

™ = rfa • v> ° (4 ' 49) 



Proposition 4.7 77ie metric tensor [gij] and its inverse [g^] are as follows: 

2a[ 



G = [ gij ] = 



V'(a 2 ) 



G- 1 = foW] 



4 «i 2 ip'(a 2 ) 2 a; 

-l + (a 2 +cti (— 3+4ai i/>' '(ai))) i/>'("2) — l+(ct 2 +ai (— 3+4ai V'(ai))) i/>'(a 2 ) 

(-3+4ai V>'(ai)) 



-l + (a 2 +ai (-3+4ai V'(ai))) tp'(a 2 ) -l+(a 2 +ai (-3+4ai ^'(ai))) i/>'(a 2 ) 



Proposition 4.8 The Christoffel symbols are 



(4.50) 



(4.51) 



□ 



! 3 + 2^(a 2 ) (3ai-2a 2 + 4ai 3 y/'(ai)) 

11 '"' 4ai (-l + (a 2 + ai (-3 + 4aiV'(ai))) ^'("2)) ' 
r i ai^ // (a 2 ) 

22 ~ -1 + (a 2 + ai (-3 + 4 ai ^'(oi))) ^'(a 2 ) ' 

r i pi = ^'M 

12 21 -2 + 2 (a 2 +ai (-3 + 4ai^(ai))) V'(o!2) ' 
r2 _ -a 2 + ai (-3 + 12ai V'(ci!i) +8ai 2 V"(Q!i)) 

11 = 8 ai 2 (-l + ( a2 + ai (-3 + 4aiV'(ai))) #(a 2 )) ' 
r2 (a 2 + ai (-3 + 4 ai ^(ai))) V^agj 

22 -2 + 2 (a 2 + ai (-3 + 4ai^'(ai))) V'(a 2 ) ' 

p2 p2 ^ 

12 21 ~ 4ai (-1 + (a 2 + ai (-3 + 4«i V'(ai))) ^'(a 2 )) ' 

□ 



Proposition 4.9 TTie curvature tensor is given by 

-^'(a 2 ) + (-a 2 + ai (-3 + 12 Ql ^(aQ + 8 o^ 2 V>"(ai))) V> w (a 2 ) , , 

1212 " 16a! 2 (-l + (a 2 + ax (-3 + 4«i V'(ai))) ^'(a 2 )) ' l ' j 

ly/w/e i/ie other independent components are zero. 

By contraction we obtain: 
Ricci tensor: 

(a 2 + ai (-3 + 4aiV'(ai))) ((-a 2 + ai (-3 + 12 ttl V'(c*i) + 8 c*i 2 V"(ai))) ^"M) 

-fill — o 

-16ai 2 (-l + (a 2 +ai (-3 + 4ai V'K))) ^'M) 
^'(a 2 ) (a 2 + «i (-3 + 4ai V'(«i))) (^'("2)) 



16ai 2 (-l + (a 2 + ai (-3 + 4 ai V'(ai))) ^'M) 2 ' 
-i)'(a 2 ) + (-a 2 + a 1 (-3 + 12 ai V'(ai) + 8"i 2 ^"(c*i))) ^"("2) 

-Rl2 = T J 

8ai(-l + (a 2 + ai (-3 + 4a x V'(ai))) ^(a 2 )) 
^ = ^(a 2 ) (^(a 2 ) + (ag + ai_(3-4ai (3_^(ai) + 2ai V^'M))) V>"(a 2 )) 
4(-l + (a 2 + ai (-3 + 4aiV'(ai))) ^'M) 2 
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Neighbourhoods oj randomness and geometry oj McKay bivariate gamma 3-manijold 




Figure 5: The scalar curvature R for McKay submanifold M3, (M with a 12 = 1). 
Scalar curvature: 

<//(a 2 ) + (a 2 + ai (3-4ai (3 V'(ai) + 2«i V"(ai)))) ip"(a 2 ) 

K = 5 . (4.55) 

2 (-1 + (a 2 + ai (-3 + 4ai ^(ai))) ip'(a 2 )) 2 

□ 



5 Applications 

The univariate gamma information geometry is known and has been applied recently to represent and 
metrize departures from randomness of, for example, the processes that allocate gaps between occurrences 
of each amino acid along a protein chain within the Saccharomyces cerevisiae genome, see Cai et al pi], 
cosmological void distribution and clustering of galaxies, and communications, Dodson [51[7||S]. 

The new results on bivariate gamma geometry have potential applications in any situation where positively 
correlated variables < x < y < 00 are used to model a process with marginal gamma distributions. In 
some of the applications mentioned above, there are other variables associated and these may yield some 
refinements of the existing models. Two new case studies are being developed at present, both involve 
bivariate data from measurements on stochastic porous media. 

The first application concerns the structure of paper and nonwoven textiles, in the manufacture of which 
fibres are deposited on a continuous filter bed and form a near-planar bonded network. In the random 
case it is easily seen that the mean number of sides is four for the polygonal voids formed by the fibre 
process. The distribution of polygonal void sizes is then given by the direct product of independent 
identical exponential distributions — reflecting the Poisson processes for fibre intersections. For isotropic 
but non-random manufacturing processes, Dodson and Sampson |2] used the product of two gamma 
distributions to obtain the void size distribution, recovering the known random model as a special case. 
For some such isotropic materials it may be appropriate to consider correlated polygon sides, since 
the voids tend to be 'roundish' suggesting positive correlation among polygon sides. Some commercial 
processes involve preferential alignment of fibres in the direction of manufacture, resulting in anisotropic 
void distributions. In these cases the product of independent gamma distributions needs to be replaced 
by the McKay bivariate model and we shall report this study elsewhere. 

The second application concerns tomographic images of soil samples in hydrology surveys. Such images 
yield 3-dimensional reconstructions of the porous structure and interestingly these generate a bivariate 
positively correlated pair of random variables < x < y < 00. Here, the larger variable represents the 
size of a void and the smaller variable represents the sizes of the throats that connect it to neighbouring 
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voids. This work is being pursued in collaboration with Professor J. Scharkanski of the Federal University 
of Rio Grande do Sul, Brazil and will be reported elsewhere 

The authors used Mathematica to perform analytic calculations and can make available working notebooks 
for others to use. 

6 Concluding remarks 

We have formalised the concept of a process being 'nearly random' by proving that in the subspace 
information metric topology of Euclidean R 3 , every neighbourhood of an exponential distribution contains 
a neighbourhood of gamma distributions. We have derived the information geometry of the 3-manifold M 
of McKay bivariate gamma distributions, which can provide a metrization of departures from randomness 
and independence for bivariate processes. Additionally, we give the metric for a 5-manifold version that 
includes location parameters for the two random variables. The curvature objects are derived for M and 
those on three submanifolds that illustrate some cases of possible practical interest. As in the case of 
bivariate normal manifolds, we have negative scalar curvature on the McKay bivariate gamma manifolds, 
but here it is not constant and we show how it depends on correlation in two cases. These results have 
applications, for example, in the characterization of stochastic materials. 
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